request interception #930

sjorsdonkers · 2025-08-07T14:58:22Z

Depends on: #922
Also reenables: http_request_start

TODO:

Enable: patterns
continueWithAuth also Enable.handleAuthRequests
fulfillRequest
InterceptResponse, also in continueRequest
continueRequest.postData
continueRequest.headers
Common headers and start_callback before intercept
Confirm string lifetime of waiting Requests

Example playwright fragment:

const page = await context.newPage();

await page.route('**/*', (route, request) => {
  console.log(`Request ROUTE: ${request.method()} ${request.url()}`);
  route.continue({url: "http://lightpanda.io/", method: "GET"});// or route.abort();
});

await page.goto('/campfire-commerce/');

karlseguin

I know this is still in draft.

I like how little impact this has on the http/*.

Do you think it's possible to re-add the http_request_fail and http_request_complete? We could then merge this into nonblocking_libcurl and then into main, as these 2 (+http_request_start) are all that's missing.

src/cdp/domains/fetch.zig

src/browser/browser.zig

src/cdp/domains/fetch.zig

sjorsdonkers · 2025-08-08T08:18:23Z

I like how little impact this has on the http/*.

My main concern, and what I got stuck on for while before ignoring it, is dealing with the start_callback. My understanding is that it can modify the headers and the body, but I'm not sure why it happens as a callback as opposed to when creating the Request.

Do you think it's possible to re-add the http_request_fail and http_request_complete? We could then merge this into nonblocking_libcurl and then into main, as these 2 (+http_request_start) are all that's missing.

I'm OK with both adding them here or taking the http_request_start out and moving it into nonblocking_libcurl. We'll see whatever turns out more convenient.

karlseguin · 2025-08-08T08:35:28Z

The only thing using the startCallback is XHR (ScriptManager is only using to log). I was going to say we can remove it..but...

XHR does two things with this:
1 - it sets the headers. This we could replace. The Request already allows a header to be passed.

2 - it gets a reference to the transfer, which it needs in case xhr.abort() is called. The only solution I can think for this is that request() would return some opaque identifier (probably the request id) and then we can cancel based on the request_id, instead of the transfer. Requires Client to maintain a list of id => transfer which is a bit unfortunate.

This ensures that page.wait won't unblock too early. As-is, this isn't an issue since active can only be 0 if there are no active OR pending requests. However, with request interception (#930) it's possible to have no active requests and no pending requests - from the http client's point of view - but still have pending-on-intercept requests. An alternative to this would be to undo these changes, and instead change Page.wait to be intercept-aware. That is, Page.wait would continue to block on http activity and scheduled tasks, as well as intercepted requests. However, since the Page doesn't know anything about CDP right now, and it does know about the http client, maybe doing this in the client is fine.

intercept continue and abort feedback First version of headers, no cookies yet

karlseguin · 2025-08-13T02:30:27Z

src/http/Http.zig

+    }
+
+    fn parseHeader(header_str: []const u8) ?struct { name: []const u8, value: []const u8 } {
+        const colon_pos = std.mem.indexOf(u8, header_str, ":") orelse return null;


indexOfScalar removes 1 len check.

karlseguin · 2025-08-13T02:58:26Z

src/browser/page.zig

@@ -467,12 +467,15 @@ pub const Page = struct {
        const owned_url = try self.arena.dupeZ(u8, request_url);
        self.url = try URL.parse(owned_url, null);

+        var headers = try HttpClient.Headers.init();


A lot of internal failure cases will result in this leaking. But I think we can leave it as-is for now.

The difficulty in resource management here is that we need to be careful about allocations up to the point where curl_multi_add_handle is called. After that, it's handled by the perform loop. So a simple errdefer won't do, since you risk double-free depending on where the failure is. A simple fix is to not callperformin makeRequest - making curl_multi_add_handle the last-called function - but I feel like delaying perform until the next tick (especially since there's no reason to think the request can't be sent out immediately) is unnecessary latency.

I want to address it, but, at this point, after we merge everything.

karlseguin reviewed Aug 8, 2025

View reviewed changes

karlseguin force-pushed the nonblocking_libcurl branch from 64f79f2 to 079ce5e Compare August 11, 2025 13:38

sjorsdonkers force-pushed the request_interception branch from 910c715 to 44e951d Compare August 12, 2025 11:34

3# This is a combination of 3 commits.

03694b5

intercept continue and abort feedback First version of headers, no cookies yet

sjorsdonkers force-pushed the request_interception branch from 44e951d to 03694b5 Compare August 12, 2025 11:49

sjorsdonkers added 2 commits August 12, 2025 14:40

Cookies

77eee7f

http_request_fail

a49154a

karlseguin reviewed Aug 13, 2025

View reviewed changes

karlseguin marked this pull request as ready for review August 13, 2025 06:44

karlseguin merged commit 2dc09c7 into nonblocking_libcurl Aug 13, 2025
12 of 14 checks passed

karlseguin deleted the request_interception branch August 13, 2025 06:44

github-actions bot locked and limited conversation to collaborators Aug 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

request interception #930

request interception #930

Uh oh!

sjorsdonkers commented Aug 7, 2025 •

edited

Loading

Uh oh!

karlseguin left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjorsdonkers commented Aug 8, 2025

Uh oh!

karlseguin commented Aug 8, 2025

Uh oh!

karlseguin Aug 13, 2025

Uh oh!

karlseguin Aug 13, 2025

Uh oh!

Uh oh!

Uh oh!

request interception #930

request interception #930

Uh oh!

Conversation

sjorsdonkers commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

karlseguin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjorsdonkers commented Aug 8, 2025

Uh oh!

karlseguin commented Aug 8, 2025

Uh oh!

karlseguin Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

karlseguin Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sjorsdonkers commented Aug 7, 2025 •

edited

Loading